Introduction

For this assignment, I choose the data set of Incarceration Trends Dataset from VERA. In all the data set it provide to me, I choose incarceration_trends.csv as my main source to analyze the unfair of incarceration. For recent year, people are pay much more attention on race equality in different aspects like working, education, or medical condition. For this research, I pay my attention to the situation of incarceration trends to study the specific problems in the system which we considered is fair to everyone as we considered before.

Summary Information

## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.1 ──
## ✔ ggplot2 3.3.6     ✔ purrr   0.3.4
## ✔ tibble  3.1.7     ✔ dplyr   1.0.9
## ✔ tidyr   1.2.0     ✔ stringr 1.4.0
## ✔ readr   2.1.2     ✔ forcats 0.5.1
## ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
## ✖ dplyr::filter() masks stats::filter()
## ✖ dplyr::lag()    masks stats::lag()
## 
## 载入程辑包:'plotly'
## The following object is masked from 'package:ggplot2':
## 
##     last_plot
## The following object is masked from 'package:stats':
## 
##     filter
## The following object is masked from 'package:graphics':
## 
##     layout
## 
## 载入程辑包:'scales'
## The following object is masked from 'package:purrr':
## 
##     discard
## The following object is masked from 'package:readr':
## 
##     col_factor
## Rows: 153811 Columns: 121
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr   (5): state, county_name, urbanicity, region, division
## dbl (116): yfips, year, fips, total_pop, total_pop_15to64, female_pop_15to64...
## 
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## # A tibble: 1 × 6
##   mean_jail_rate mean_prison_rate mean_male_rate mean_female_rate
##            <dbl>            <dbl>          <dbl>            <dbl>
## 1           4.32             6.69           7.51             1.04
## # … with 2 more variables: mean_white_prison_rate <dbl>,
## #   mean_black_prison_rate <dbl>

From the summary information list I get from the dataset above, I found that during all the population, the population in Jail(mean rate is 4.318663%) is less than the population in Prison(mean rate is 6.689772%). When I focus on gender problems which also been mentioned by people more frequently recently, I found that in all population, no matter in Jail or Prison, the average rate of Male(7.514543%) is higher than average rate of Female(1.0408%). After that, I move to the race problems which this assignment most care about, and I found that the average rate of black people(27.47658%) is much higher than the average rate of white people(4.463364%) .

Variable comparison chart

## Rows: 153811 Columns: 121
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr   (5): state, county_name, urbanicity, region, division
## dbl (116): yfips, year, fips, total_pop, total_pop_15to64, female_pop_15to64...
## 
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## Warning: Removed 2 row(s) containing missing values (geom_path).
## Removed 2 row(s) containing missing values (geom_path).

For this two variable comparison chart, I compare the general situation of different gender between female and male in Prison. It is easy to find that the size of female population is relative lower than male population is the whole history in New York County.

Map Data

## Rows: 153811 Columns: 121
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr   (5): state, county_name, urbanicity, region, division
## dbl (116): yfips, year, fips, total_pop, total_pop_15to64, female_pop_15to64...
## 
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

In this map, I show the average value of Minority Proportion in different state to give a glimpse to the racial situation in the whole America. From this map, we can easily tell how serious the inequity of different is in different state of America.